Sorting Email Messages by Topic Project for CPSC 503 and CPSC 532b
نویسنده
چکیده
The problem of categorizing a document by topic can be divided into three steps. First, the document’s text must be converted into potential features. From these many potential features a few receive promotion to feature status during the feature selection step. Finally, a classifier learns to categorize documents into topics based upon these features. BUCFE, Bottom-Up Combinatorial Feature Extraction, adds to the commonly used individual words syntactically parsed sentence fragments as potential features. Here, a neural network using relevant term selection and the Winnow algorithm make use of BUCFE’s suggestions. The problem domain of categorizing email messages into mailboxes provides a real-world test bed.
منابع مشابه
CPSC 503 - Final project Part-of-speech filtering in unsupervised learning to improve discourse relation recognition
I evaluate a technique for improving the accuracy of discourse relation recognition by unsupervised classifiers that involves filtering the input features based upon their parts of speech. I report on experiments on various corpora and training set sizes in which classifiers trained on filtered features are less accurate than equivalent classifiers trained on unfiltered features.
متن کامل"I'm from the government and I'm here to help".
Since its inception, the U.S. Consumer Product Safety Commission (“CPSC”) has encouraged companies to implement active product safety management programs. Since 2010, however, the CPSC has made this a bit more official. Requirements for the establishment of safety compliance programs has appeared in a final rule of factors to be considered for civil penalties, in a number of consent decrees and...
متن کامل[Comparative study of Acid extraction tests of metal products containing lead].
The international standard ISO 8124-3: 1997 "Safety of toys -Part 3: Migration of certain elements" and "Interim Enforcement Policy for Children's Metal Jewelry Containing Lead- 2/3/2005" by the U.S. Consumer Product Safety Commission (CPSC) to control the amount of eluted lead from metal accessories cannot be simply compared, because the acid extraction methods and the limit values are differe...
متن کاملConscious Presence and Self Control as a measure of situational awareness in soldiers – A validation study
UNLABELLED BACKGROUND The concept of `mindfulness´ was operationalized primarily for patients with chronic stressors, while it is rarely used in reference to soldiers. We intended to validate a modified instrument on the basis of the Freiburg Mindfulness Inventory (FMI) to measure soldiers' situational awareness ("mindfulness") in stressful situations/missions. The instrument we will explore...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999